What Is Coreference, And What Should Coreference Annotation Be?
نویسندگان
چکیده
In this paper, it is argued that 'coreference an-notation', as currently performed in the MUC community, goes well beyond annotation of the relation of coreference as it is commonly understood. As a result, it is not always clear what semantic relation these annotations are actually encoding. The paper discusses a number of interrelated problems with coreference annotation and concludes that rethinking of the coreference task is needed before the task can be expanded (e.g., to cover part/whole relations) as has recently been advocated. As a step towards solution of the problems with coreference annotation , one possible simplification of the annotation task is suggested. This strategy can be summed up by the phrase "Coreference annotation should annotate coreference relations, and coreference relations only". 1 Introduction: Coreference Annotation Various practical tasks requiring language technology including, for example, information extraction and text summarization, can be done more reliably if it is possible to automatically find parts of the text containing information about a given topic. For example, if a text sum-marizer has to select the most important information , in a given text, about the 1984 Wall Street crash, then the summarization task is greatly helped if a program can automatically spot all the clauses in the text that contain information about this crash. To 'train' a program of this kind, extensive language corpora have been prepared in which human readers have annotated what has been called the coref-erence relation. These annotated corpora are then used as a 'gold standard' against which the program's achievements can be compared.
منابع مشابه
ITRI-00-32 On Coreferring: Coreference in MUC and Related Anotation Schemes
In this paper, it is argued that 'coreference' annotations, as performed in the MUC community for example, go well beyond annotation of the relation of coreference proper. As a result, it is not always clear what semantic relation these annotations are encoding. The paper discusses a number of problems with these annotations and concludes that rethinking of the coreference task is needed before...
متن کاملOn Coreferring: Coreference in MUC and Related Annotation Schemes
In this paper, it is argued that "coreference" annotations, as performed in the MUC community for example, go well beyond annotation of the relation of coreference proper. As a result, it is not always clear what semantic relation these annotations are encoding. The paper discusses a number of problems with these annotations and concludes that rethinking of the coreference task is needed before...
متن کاملCorpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملCorefrence resolution with deep learning in the Persian Labnguage
Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...
متن کاملAcquiring Domain Specific Knowledge and Coreference Cues for Coreference Resolution
The goal of a coreference resolution (CR) system is to take unstructured text and produce coreference chains of all the entities within it. This task is normally broken down into subordinate steps, such as detecting what entities can participate in coreference relations and what links between entities can be determined. This research attempts to address several problems found in general corefer...
متن کامل